Assessing and quantifying inter-rater variation for dichotomous ratings using a Rasch model.
نویسندگان
چکیده
We present a new model-based approach to the analysis of agreement between raters in a situation where all raters have supplied dichotomous ratings of the same cases in a sample. The model is a logistic regression model with random effects--a Rasch model. In the rater setting, the Rasch model includes parameters that allow raters to have different propensities to score a given set of individuals positively or negatively--the rater bias. An exact score test of the hypothesis of no rater bias is proposed and is shown to be an exact generalised McNemar's test. Based on the model, we suggest quantifying the rater variation as a suitable measure of the variation of the rater odds ratios. An important example that will serve to motivate and illustrate the proposed model, is the study of Umbilical artery Doppler velocimetry used by obstetricians to assess the status of a foetus. The purpose of the assessment is to improve the foetus' chance of survival by choosing the optimal time of elective delivery. In the study, data related to 139 perinatal deaths were sent to 32 experts who were asked whether the use of Doppler velocimetry might have prevented each death.
منابع مشابه
Rater Errors among Peer-Assessors: Applying the Many-Facet Rasch Measurement Model
In this study, the researcher used the many-facet Rasch measurement model (MFRM) to detect two pervasive rater errors among peer-assessors rating EFL essays. The researcher also compared the ratings of peer-assessors to those of teacher assessors to gain a clearer understanding of the ratings of peer-assessors. To that end, the researcher used a fully crossed design in which all peer-assessors ...
متن کاملLocal independence and residual covariance: a study of olympic figure skating ratings.
Rasch fit analysis has focused on tests of global fit and tests of the fit of individual parameter estimates. Critics have noted that slight, but pervasive, patterns of misfit to a Rasch model within the data may escape detection using these approaches. These patterns contradict the Rasch axiom of local independence, and so degrade measurement and may bias measures. Misfit to a Rasch model is c...
متن کاملAcoustic and Temporal Analysis for Assessing Speaking
Oral assessment in language learning has received increasing attention among second language acquisition (SLA) researchers. This growing interest is likely a product of the increased interpretability of test scores and potential validity of the scores when linked to real-world criteria (Bonk & Ockey, 2003). However, assessing speaking skill can be more challenging than assessing other skills be...
متن کاملIntra- and inter-rater reliability of the assessment of capacity for myoelectric control.
OBJECTIVE To examine the reliability of the Assessment of Capacity for Myoelectric Control (ACMC) in children and adults with a myoelectric prosthetic hand. DESIGN Intra-rater and inter-rater reliability estimated from reported assessments by 3 different raters. PATIENTS A sample of convenience of 26 subjects (11 males, 15 females) with upper limb reduction deficiency or amputation and myoe...
متن کاملOn the Multivariate Rasch Model: Assessing Collaboration in Multiple Choice Tests
We examine the Rasch model for latent structure para- meters in binary and multiple response questionnaires and develop methodologies and data-analytic tools for assessing collaboration/che- ating in multiple choice tests
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistical methods in medical research
دوره 21 6 شماره
صفحات -
تاریخ انتشار 2012